Picture for Shanghang Zhang

Shanghang Zhang

Demo-JEPA: Joint-Embedding Predictive Architecture for One-shot Cross-Embodiment Imitation

Add code
May 20, 2026
Viaarxiv icon

World-Ego Modeling for Long-Horizon Evolution in Hybrid Embodied Tasks

Add code
May 19, 2026
Viaarxiv icon

Dexora: Open-source VLA for High-DoF Bimanual Dexterity

Add code
May 18, 2026
Viaarxiv icon

SceneParser: Hierarchical Scene Parsing for Visual Semantics Understanding

Add code
May 14, 2026
Viaarxiv icon

HarmoWAM: Harmonizing Generalizable and Precise Manipulation via Adaptive World Action Models

Add code
May 11, 2026
Viaarxiv icon

VEGA: Visual Encoder Grounding Alignment for Spatially-Aware Vision-Language-Action Models

Add code
May 11, 2026
Viaarxiv icon

Hi-WM: Human-in-the-World-Model for Scalable Robot Post-Training

Add code
Apr 23, 2026
Viaarxiv icon

Mask World Model: Predicting What Matters for Robust Robot Policy Learning

Add code
Apr 22, 2026
Viaarxiv icon

HEX: Humanoid-Aligned Experts for Cross-Embodiment Whole-Body Manipulation

Add code
Apr 09, 2026
Viaarxiv icon

ConceptWeaver: Weaving Disentangled Concepts with Flow

Add code
Mar 30, 2026
Viaarxiv icon